German regional variants - a problem for automatic speech recognition?
نویسندگان
چکیده
A well known problem in automatic speech recognition (ASR) is robustness against the variability of speech between speakers. There are several ways to normalise different speakers; one of them is to deal with the problem of regional variation. In this paper we discuss the problem of whether moderate regional variants of German in uence the automatic speech recognition process and whether there is a way to improve performance through knowledge of the regional origin of the unknown speaker. The basic idea in our experiment is to cluster test speakers into distinct dialectal regions and derive observations about the typical pronunciation within these regions from a classi ed training set. In a cheating experiment where the origin of the test speakers is known we verify whether the use of the dialect-speci c pronunciation forms will improve the overall performance of the recognizer. It turns out that simply using dialect-speci c pronunciation does not signi cantly improve word accuracy on the VERBMOBIL 1996 task.
منابع مشابه
RVG 1 - A Database for Regional Variants of Contemporary German
Regional speaker variability is a major problem in today's stateof-the-art speech recognition systems. Therefore, a major point in the creation of speech resources is the regional coverage of data within one language. At the beginning of 1996 we started to collect data for the RVG1 (Regional Variants of German) corpus. This project was established in cooperation between the American telephone c...
متن کاملTowards a Localised German Automatic Speech Recognition
Spoken languages are often rich in regional accents and dialects. These local variations often pose challenges to automatic speech recognition. In this study, we analyse the influence of German regional accents on the performance of a large vocabulary continuous speech recogniser trained on standard German data. The experiments show a large variation in the error rate over different regions. We...
متن کاملRegional Pronunciation Variants for Automatic Segmentation
The goal of this paper is to create an extended rule corpus with approximately 2300 phonetic rules which model segmental variation of regional variants of German. The phonetic rules express at a broad-phonetic level phenomena of phonetic reduction in German that occurs within words and across word boundaries. In order to get an improvement in automatic segmentation of regional speech variants, ...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملDesigning and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998